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METHOD AND APPARATUS FOR MULTIPLEXING VOICE OVER ATM OR IP 
DATA COMMUNICATION CHANNELS 

TECHNICAL FIELD 
5 This invention relates to communications systems generally and, more 

particularly, to a method and apparatus for adaptively encoding and multiplexing audio 
data. 

BACKGROUND OF THE INVENTION 
10 Present techniques for incorporating audio data within existing network 

transport systems tend to provide high quality audio information at the expense of 
network capacity. For example, telecommunications networks typically utilize one 
digital signal level 1 (DS-1) signal path operating at 64 kilobits per second to transport 
one voice channel. While such voice data is of a relatively high quality or fidelity, the 



IjU 15 bandwidth or network capacity required to transport such voice data is relatively high. 

Q 

q Therefore, it is seen to be desirable to provide a method and apparatus that 

w balances the competing requirements of high perceived speech quality, high capacity, 

O moderate processing requirements and robust performance. More particularly, it is seen 

to be desirable to provide such voice data via a network transporting voice data and 

jfJ 20 non-voice data. 

O 

SUMMARY OF THE INVENTION 
The invention comprises a method and apparatus for providing, illustratively, 
voice data over an asynchronous transfer mode (ATM), Internet Protocol (IP) or other 
25 communications network where the voice data is preferentially encoded at a high 
fidelity level, formed into transport cells or packets, and multiplexed onto a 
communications channel with, optionally, data traffic. In response to network 
congestion or buffer over utilization, the voice data is encoded at a medium or low 
fidelity level to reduce the amount of encoded data necessary to represent the voice 
30 data. 

In one embodiment of the invention a plurality of encoders has associated with it 
a respective cell assembly buffer for forming the cells or packets and passing the ATM 
cells or packets to a transmit buffer. Each of the encoders is responsive to buffer 
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utilization levels of the transmit buffer and the respective cell assembly buffer to adapt 
the voice encoding from a preferred high fidelity level to a medium fidelity level or 
even a low fidelity level. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

The teachings of the present invention can be readily understood by considering 
the following detailed description in conjunction with the accompanying drawing in 
which: 

FIG. 1 depicts a high level block diagram of an audio encoding and multiplexing 
10 system including the present invention; 

FIG. 2 depicts a high level block diagram of a controller suitable for use in the 
system of FIG. 1; 

FIG. 3 depicts a flow diagram of a first method of encoding according to the 
present invention; and 

W 15 FIG. 4 depicts a flow diagram of a second method of encoding according to the 

P present invention. 

To facilitate understanding, identical reference numerals have been used, where 
O possible, to designate identical elements that are common to the figures. 

3 — 

!5 20 DETAILED DESCRIPTION OF THE INVENTION 

Q 

O The subject invention will be described within the context of the adaptive 

encoding of a plurality of voice channels to form respective streams of asynchronous 
transfer mode (ATM) cells, internet protocol (IP) packets or other transport entities 
which are then multiplexed onto a communications link, such as a Tl link. It will be 

25 appreciated by those skilled in the art that different transport cell formats may be used, 
different communication links may be served, and different encoding techniques may be 
employed by those skilled in the art and informed by the present disclosure. 

FIG. 1 depicts a high level block diagram of an audio encoding and multiplexing 
system including the present invention. Specifically, each of a plurality of voice 

30 channels Vi through V N (collectively voice channels V) is coupled to a respective 

analog-to-digital (A/D) converter 110! through 110 N (collectively A/D converters 110). 
Each A/D converter 110 responsively provides a respective digitized voice signal Di 
through D N (collectively digitized voice signals D) to a respective encoder 120J through 
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120 N (collectively encoders 120). Each encoder 120 encodes its respective digitized 
voice signal D to produce a respective encoded output signal Ej through E^ (collectively 
encoded output signals E). Each encoded output signal E is coupled to a respective cell 
assembly buffer queue 132! through 132 N (collectively cell assembly buffer queues 
5 132). Together, the cell assembly buffer queues 132 form the cell assembly buffer 130. 
Each cell assembly buffer queue 132 provides at an output a respective formed cell or 
packet C x through C N (collected formed cells C). Each of the formed cells C is coupled 
to a ready to transmit buffer 140. It should be noted that the formed cells C may 
comprise ATM cells, IP packets or other transport entities. 
10 In the system 100 of FIG. 1, a quantity Q^t) provided by the cell assembly 

buffer 130 represents the utilization level or total amount of information (bits or 
t& fractional cells) in the cell assembly buffer 130 at time t. A quantity Q 2 (t) provided by 

\ji the ready to transmit buffer 140 represents the utilization level or total amount of 

Jil information in the ready to transmit buffer at the time t. Cells are formed by encoding 

W 15 the voice data streams and assembling cells or packets including encoded voice data (as 

□ described below). When a cell or packet is completely assembled, it is "popped" into 

^ the ready to transmit buffer 140. The cells or packets are normally served or "popped" 

O in a FIFO fashion. If two or more cells are served simultaneously, the served cells may 

u be ordered randomly, sequentially, or in a predetermined pseudo-random arrangement 

;~ 20 in the buffer. The ready to transmit buffer 140 transmits the stored cells or packets at a 

P rate of, illustratively, 1.536 Mbps in the case of the communications network 

comprising a Tl link. In the case of the communications network comprising a 
different type of link, other data rates are, of course, utilized. Header insertion may be 
done when the product is popped, or at any other time prior to transmission. 
25 In the system 100 of FIG. 1, each of the A/D converters 1 10 generates voice 

samples at a rate of 8,000 samples per second (i.e., 8 kHz). The 8 kHz sample rate is 
selected to conform to the normal bandwidth utilization in the existing telecom voice 
networks, where voice information up to a frequency of approximately 4 kHz may be 
carried. In the case of higher quality voice data, higher sampling rates for the A/D 
30 converters may be used. Each sample comprises 4 bits; namely, 2 high-priority bits, 1 
medium-priority bit, and 1 low-priority bit. The 2 high-priority bits are necessary to 
reconstruct the sample and comprise the two most significant bits of the 4 bit sample. 
The next significant bit comprises the medium priority bit, while the least significant bit 
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comprises the low-priority bit. The 4 bits are passed from each A/D converter 110 to 
the corresponding encoder 120. 

Each encoder 120 encodes the respective digitized voice samples D at, 
illustratively, three fidelity levels; namely, high fidelity, medium fidelity, and low 
5 fidelity. For purposes of this discussion, assuming 4 bit samples at 8 kHz from the A/D 
converters 110, high fidelity encoding comprises 32 Kbps encoding, medium fidelity 
encoding comprises 24 Kbps encoding, and low fidelity encoding comprises 16 Kbps 
encoding. This encoding is performed by simply taking all 4 bits of each sample (high 
fidelity encoding), the three most significant bits of each sample (medium fidelity 
10 encoding) or just the two most significant bits of each sample (low fidelity encoding). 
Thus, encoders 120 cause a "loss" in audio fidelity by deliberately deleting one or more 
of the least significant bits of the digitized voice stream D. 



'kQ The encoders 120 may delete bits, or cause fidelity loss, in response to the 



control signal CE produced by the controller 200 or, in the case of the encoder including 

W 15 the internal control logic, in response to a control signal indicative of network 

p 

O congestion, such as an over utilized cell assembly buffer 130 or an over utilized ready to 

transmit buffer 140. It is noted that a voice connection is perceived to be of high quality 
using 32 Kbps encoding, good quality at 24 Kbps encoding, but noticeably degraded 
and marginal at 16 Kbps encoding quality. 
20 With respect to the cell assembly buffer 130, each of the N queues 132 accepts 

samples from a respective encoder 120. Each cell or packet that is assembled includes a 
fixed overhead used for signaling the coding rate provided in that cell. That is, for each 
cell or packet (or, optionally, groups of packets), a fixed overhead of between 0 and 4 
bytes is included to identify the coding rate used such that a foreign decoder may 
25 appropriately decode the voice data. A 4 byte header is used in the ATM Adaptation 
Layer 2 (AAL2) standard defined and thee International Telecommunications Union- 
Telecommunication (ITU-TM) Standard i.363.2, which describe ATM Adaptation used 
for variable-byte-rate voice-over-ATM services. However, it is noted by the inventor 
that 4 bytes of overhead is unduly wasteful within the context of the present invention 
30 since, in the case of three fidelity levels of encoding, a 2 bit representation can signal 
which of the three encoding levels has been selected. 

In the preferred embodiment of the invention, after each queue has accepted 48, 
47 or 44 bytes worth of information (depending on overhead), the queue immediately 
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provides the completed cell or packet to the ready to transmit buffer 140. At the same 
time, a 5 byte header is appended (in the case of ATM cells) to each cell so that the 
ready to transmit buffer receives 53 bytes at a time. It is noted that standard ATM cells 
provide 48 payload bytes and 5 header bytes, resulting in, typically, 53 bytes per cell. 
5 For IP cells (i.e., IP packets), header sizes may be adapted to suit any system 
requirements, as known to those skilled in the art. Therefore, the number of bytes 
appended or used for cells comprising IP packets may vary. 

In one embodiment of the invention, the encoders 120 utilize silent suppression 
algorithms to determine whether or not data should be transmitted to respective cell 

10 assembly buffer queues. In this embodiment of the invention, the encoder 120 only 
transmits data to the respective cell assembly buffer queue 132 after a predetermined 
period of silence has elapsed or non-silence data has been received. 

The system 100 of FIG. 1 includes a controller 200 for executing methods 
according to the invention. Specifically, the controller 200 executes methods which 

15 will be described in more detail below with respect to FIGS. 3 and 4 for modifying 
encoder operation and/or cell assembly buffer operation in response to network 
congestion or other factors. Briefly, the controller 200 receives control signals 
indicative of the amount of data within the cell assembly buffer 130 and the ready to 
transmit buffer 140. In response to these control signals, the controller 200 modifies the 

20 operation of the encoder 120 and/or the cell assembly buffer 130 to reduce the amount 
of data representing the respective voice channels such that communication path 
congestion is reduced. Such adaptation necessarily results in a reduced fidelity of the 
representation of the voice data Vj through V N within the system. Specifically, the 
voice data V, through V N is encoded and provided to the communications network in 

25 one of a low fidelity, medium fidelity or high fidelity representation. The high fidelity 
representation of the voice data is preferred and normally provided to the 
communications network. In the event of communication channel congestion or buffer 
over utilization, the fidelity level of one or more voice channels is responsively reduced 
such that at least a low fidelity representation of each voice channel is provided. The 

30 operation of the controller 200 will be discussed in more detail below with respect to 
FIGS. 2-4. 

In one embodiment of the invention, the voice data channels V! through V N are 
multiplexed with a DATA channel. In this embodiment of the invention, the system 
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00 of HO , lncludes a pacto assemMy buffer (ha( ^ a ^ 

Tanspor, entities are provided by the packet assembly buffer 15J to an 
multiplexer 158 via the signal path DC. 

5 .55 „T T mUWPteXer *" ° UtPU ' ° f ' he buffer 

15 and the ready to transmit buffer 140 to form an output stream OUT2 tna, is coupled 

» the communtcations network. I, is noted tha, me output mutoplexer 158 opera Jin a 
weighted round robin fashion, thereby insuHng a, leas, a minimum level of Ja," 
service (QOS) for the voice or audio data provided by the ready to transmit buffer ,40 
Thus, the second cell assembly buffer 155 and output multiplexer 158 form a data ' 
insertion module 150 prov.di„ g for multiplexed da. and voice over a communication 
channel using DTM, JP or other transport entities. 

HO. 2 depicts a high ,eve, Week diagram of a controHer suitable for use in the 

uncdonal imp.emen.auon of the multiple voice charme, encoding system ,00 of FIG . 
w,ll be appreciated by those skilled in the art that the control functions dep i c ,ed 
herein may be implemented by a dedicated confer, such as the controller 200 
depicted above in F.G. 1, or by contro, functions embedded within ,„e various 

2 IT MOre ° VeMhefUnC,iOnSOf,he ~"0a„dce U assemb,ybuffers 
30 of me system .00 of P.O. , may be optionally included within the controller 200 
In this embodiment of the invention, the encoding and eel, assembly functions for each 
voice channel are performed by the controller 200. 

lheCOn,ro,lCT200 '> f H°-2comprisesaprocessor220aswe„a S memo^240 
storing an initial encoding method 300, a retroactive encoding method 400 other 
programs and methods 245 and system parameters 24 6 . The processor 220 cooperates 
w.,h conventional support circuitry 230 such as power supplies, clock circuits, cache 
memory and the ,ike as we,, as circuits tna, assist in executing the software methods 
The processor 220 also cooperates wim input/output (I/O) circuit 2.0 to form an 
interface between the contro, functions implemented by the various methods and the 
tac n o na , e , enKntsoftllesysteml00ofFio , Specifically, the controller 200 of 
HO. 2 provides a plurahty of output signals CE, through CEn (collectively contro, 
signals CE) which are used to contro, the operation of the encoders 120, through 120 
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Additionally, the controller 200 provides an output signal C AB that is used to control the 
operation of the cell assembly buffer 130. 

The subject invention adapts the amount of data used to represent input voice 
data in response to buffer utilization levels and/or network congestion levels. The 
5 invention utilizes two watermarks, B Y and B 2 to represent utilization or congestion 
levels beyond which the amount of data used to represent the voice inputs is reduced. 
Specifically, the effective coding rate of the voice data is reduced as the cell assembly 
buffer queues 132 and/or read to transmit buffer 140 are nearly full, as determined by 
the relationship between the watermarks and the buffer fullness indicators. As 
10 previously noted, a first quantity Q 2 (t) indicates the amount of data or utilization level 
of the ready to transmit buffer 140, while a second quantity Q^t) represents the amount 
_ of data or utilization level of the cell assembly buffer 130. A total quantity Q(t) is equal 

3 to Q^t) + Q 2 (t). 

i The present invention may use either or both of two coding methods, which will 

W 15 be described in more detail below with respect to FIGS. 2 and 3. Briefly, the first 

O 

p coding method is denoted herein as Initio coding and is described in more detail below 

]** with respect to FIG. 2. In this coding method, a decision is made with respect to coding 

O fidelity prior to the assembly of a cell. 

\1 The other distinction between our coding algorithms was the question of when 

20 the coding rate for any particular cell would be decided. It seems natural to decide at 
O the beginning of the assembly of each cell whether or not the queue length (as measured 

by either Q(t) or Q 2 (t)) is above a watermark. This control—deciding on the coding rate 
as the cell is initialized—is referred to as Initio coding. If Q(t) < B! ( or Q 2 (t) < B^, then 
the cell is coded at 32 Kbps. If B! < Q(t) < B 2 (or B x < Q 2 (t) < B 2 ), then the cell is coded 
25 at 24 Kbps. And if Q(t) > B 2 (or Q 2 (t) > B 2 ), then the cell is coded at 16 Kbps. When 
the cell is filled (after 12, 16 or 24 ms, corresponding to cell assembly time for 32 Kbps, 
24 Kbps, or 16 Kbps respectively) it is popped to the transmit queue. 

There is another possible method of deciding, denoted herein as Retro coding. 
All cells can be initially coded at 32 Kbps, the full rate. Then, when they are filled and 
30 ready to be popped to the ready to transmit queue, the queue length can be checked. If 
Q(t) < B t , then the cell immediately leaves the assembly buffer and joins the transmit 
buffer. If B, < Q(t), then the low priority bits are dropped, and coding continues at 24 
Kbps. When the cell is filled with high and medium priority bits (4 ms later), Q(t) is 
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checkeda gai , If Q(0<B i *e„ th ece„ ispop pe dtotheBansmitbllffer 

TmT Pri ° ri ! UU " " " d codi " 8 contaues " 16 Wb- *= ce„ 

f,,led " 1S Popped without further checking. 

To surrunarize, **, and toro codi „ g for ^ ^ 

Q >-based contro,. We emulated with three sets „f watermark levds: B, = 63 B - 68 

ce U s„ ut o f B = 72 cen s ,B,= 5 ^ I = 65ce I1 s,a„ d a 1 s„B,=36,B 2 = 5 4 ce 1Is S;I 
corresponds ,o ,0 ms delay a, L536 Mhps; 54 ce„s corresponds ,o „ m S; 65 

corresponds to ,8 ms; 63 ce„s corresponds ,o 17.5 ms; and 68 ce„s corresponds t o „ 

ms). Wes,m ru a t e <1 a, lt he S econc Utl o„ S assu mi „ gth a tt he re w e reO, l,or4b yt esof 
10 overhead per cell. y 

done *nZT l SPeCiflCa,i °" ™ " 1CaS ' ** ° f ' hC ^ «»" >* 

*» O0M, % oss rate. The proposed voice multipfcxer has a buffer of capaci, y 20 ms 
* 32 K ps codmg and 0 overhead,. This was rhe ,ar g es. delay that we 
» e accep able, .hen combined „i,h other sources of de,a y in the system. Now ,ar g er 
»ffe, obvious,, have ,o„er ,oss than smaUer one, A,so, the ,ower the waterrnaT 
veis, the ,ower the loss, but ,ower watermark levels also ,ead to tower coding rates 
Oess codtng a, 32 Kbps, So our objective is to take the highest watermark levels 
with acceptable loss, and furthermore to understand the sensitivity „, the 
20 performance of the system (codfag raK , , MS raK) ^ ^ J 

(watermark levels, «* vs. Re ,ro coding, Q(t) v , feedback, overhead, and of 

course loading n). 

FIG. 3 depicts a flow diagram of a firs, method of encoding according to me present 
2S " SPeCi T y ' ,heme *" 300 ° mG - 3 ^ rite,he ^''"--„coder 

300 of HG. 3 operates to encode each voice sample a, a leve! commensurate with the 
avadable bandwidth within the system. That is, the method 300 of HG. 3 detent 
pnor to the formation of an ATM eel, or rP packet whether the ATM cell or IP pack!' 

30 dtscussed w,th respect to P.O. !, this determination is made based upon system 

congesuon as indicate, by the amount of data in the ready to transmit buffer ,40 (i.e. 
Q*» and/or the amount of data in the respective ce„ assembly buffer ,30 (Q (,„ 
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The method 300 of FIG. 3 is entered at step 305 when a new ATM cell or IP 
packet is to be formed within the cell assembly buffer 130. The method 300 then 
proceeds to step 315. 

At step 315 a query is made as to whether the amount of data Q 2 (t) within the 
ready to transmit buffer 140 is less than a first watermark level If the query at step 
315 is answered negatively, then the method 300 proceeds to step 325. If the query at 
step 315 is answered affirmatively, then the method 300 proceeds to step 320 where the 
coding mode is set at a high fidelity level, such as 32 Kbps. The method 300 then 
proceeds to step 340. 

At step 325, a query is made as to whether the amount of data in the ready to 
transmit buffer 140 is greater than or equal to a second watermark level B 2 . If the query 
at step 325 is answered negatively, then the method 300 proceeds to step 335 where the 
coding mode is set at a medium fidelity level, such as 24 Kbps. If the query at step 325 
is answered affirmatively, the method 300 proceeds to step 330 where the coding mode 
is set at a low fidelity level, such as 16 Kbps. The method 300 proceeds to step 340. 

At step 340, the encoder 120 receives a sample to be encoded from the A/D 
converter 1 10. As previously discussed, in the three fidelity level embodiment of the 
invention, the A/D converter 1 10 comprises a 4 bit A/D converter where the two most 
significant bits are necessary for a low fidelity encoding, the three most significant bits 
are necessary for a medium fidelity encoding and all four bits are necessary for a high 
fidelity encoding. The method 300 then proceeds to step 342. 

At step 342, the sample is coded and added to an ATM cell or IP packet being 
formed within the cell assembly buffer 130. The method 300 then proceeds to step 345. 

At step 345, a query is made as to whether the cell or packet being formed has 
been filled. That is, a query is made as to whether the payload portion of the ATM cell 
or IP packet being formed is filled such that additional coded samples may not be 
added. If the query at step 345 is answered negatively, then the method 300 proceeds to 
step 340 to receive the next sample to be coded. If the query at step 345 is answered 
affirmatively, then the method 300 to proceeds to step 350, where the cell or packet that 
has been filled is "popped" into the transmit queue. The method 300 then proceeds to 
step 305 where a new cell or packet is started. 

The embodiment of the invention described above with respect to FIG. 3 
envisions an encoding decision made prior to the formation of an ATM cell, IP packet, 
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or other transport entity. I„ one embodiment of the invention, the ATM eel,, IP packet 
or other transport entity „ fomKd ^ to ^ ^ ^ ^ ^ 

of the .nvennon is denoted as a retroactive coding embodiment and is found to be usefu, 
stnee the vast majority of encoding will occur a. a high fidelity leve.. Thus the 
processing required ,o determine whether system congestion is ,00 high for'a particular 
encodmg ,evel (e.g. steps 315 and 325, is avoided. The retroactive coding embodiment 
will now be described with respect to FIG. 4. ™>odiment 

HO. 4 depicts a flow diagram of a second method of encoding according , 0 the 
present invention. 6 

10 The method 400 of FIG. 4 is entered at step 405 when a new ATM cell IP 

pac.ee, or other transport entity is started, me method 400 then proceeds to step 410 
At step 4!0, a sample to be coded is received by the encoder 120 from the A/D 
converter 110. The metbod 400 then proceeds to step 415, where the received sample is 
coded a, a h gh fidelity leve, (such as 32 Kbps) and to step 420 where the coded sample 

^ddedtotheATMcen.rPpac.etorothertransportenti.ybeingformed. Theme! 

4UU then proceeds to step 425. 

A, step 425, a query is made as to whether the ATM cell, IP packet or other 
-spor, »,i,y has been fiUed. If the que,, at step 425 is answered negative*, then the 
method 400 proceeds to step 410 where the „ sample to be coded is received. 
20 query at step 425 is answered affirmatively, then tire method 400 proceeds to step 430 
A. step 430, a query is made as to whether the amount of data in the ready to 
transmit buffer 140 (i.e„ Q 2 (,» „ ,e ss than . n m nteimiak leve , ^ ff ^ ^ 
step 430 is^swered affirmatively, then ti,e method 400 proceeds ,o step 435 where the 
ATM cell, IP packet or other transport entity is popped i„ to the transmit queue within 
the ready to transmit buffer ,40, and to step 405 where a new cell issued. Ifthe 
quety at step 430 is answered negatively, then the method 400 proceeds to step 440 

Th A,Step440 ' ,helo "P ri '' rit >' l ' i '^«taPPedfrom 1 hecodedsam P ,eswi«hin,he 
cd That ,s, assuming tha, the A/D converters , ,0 comprise 4 bit A/D converters and 

30 7 t ^ U0 ^ hyered a ' g ° ri,hm ^ove 

-wo htgh, one medium and one low priority bit), the low priority bi.(s, are dropped 

'eavmg samples effective* coded a, a medium fidelity level. The method 400 then 

proceeds to step 445. At step 445, a samp.e to be coded is received by the encoder ,20 

from the A/D converter 1 10. The method 400 then proceeds to step 450, where the 
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received sample is coded at a medium fidelity level (such as 24 Kbps) and to step 455 
where the coded sample is added to the ATM cell, IP packet or other transport entity 
being formed. The method 400 then proceeds to step 460. 

At step 460, a query is made as to whether the ATM cell, IP packet or other 
transport entity has been filled. If the query at step 460 is answered negatively, then the 
method 400 proceeds to step 445 where the next sample to be coded is received. If the 
query at step 460 is answered affirmatively, then the method 400 proceeds to step 465. 

At step 465, a query is made as to whether the amount of data in the ready to 
transmit buffer 140 (i.e., Q 2 (t)) is less than a second watermark level B 2 . If the query at 
step 465 is answered affirmatively, then the method 400 proceeds to step 435, where the 
ATM cell, IP packet or other transport entity is popped into the transmit queue within 
the ready to transmit buffer 140, and to step 405 where a new cell is started. If the 
query at step 465 is answered negatively, then the method 400 proceeds to step 470. 

At step 470, the medium priority bits are dropped from the coded samples within 
the cell. That is, assuming that the A/D converters 1 10 comprise 4 bit A/D converters 
and that the encoders 120 utilize the layered coding technique described above (two 
high, one medium and one low priority bit), the medium priority bit(s) are dropped, 
leaving samples effectively coded at a low fidelity level. The method 400 then proceeds 
to step 475. 

At step 475, a sample to be coded is received by the encoder 120 from the A/D 
converter 110. The method 400 then proceeds to step 480, where the received sample is 
coded at a low fidelity level (such as 16 Kbps) and to step 485 where the coded sample 
is added to the ATM cell, IP packet or other transport entity being formed. The method 
400 then proceeds to step 490. 

At step 490, a query is made as to whether the ATM cell, IP packet or other 
transport entity has been filled. If the query at step 490 is answered negatively, then the 
method 400 proceeds to step 475, where the next sample to be coded is received. If the 
query at step 490 is answered affirmatively, then the method 400 proceeds to step 435. 

The methods described above with respect to FIGS. 3 and 4 depict the 
representation of audio data in a layered fashion where the 4 bit representation includes 
two high priority, one medium priority and one low priority bit. Low fidelity, medium 
fidelity and high fidelity coding of the samples is achieved by removing both the low 
and medium fidelity bits, only the low fidelity bits or none of the bits, respectively. It 
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will be appreciated by those skilled in the art that more fidelity levels may be provided 
within the system by utilizing an A/D converter having more bits or by other coding 
techniques. 

Per the interview with the inventor, the invention is disclosed within the context 
of voice over ATM. However, the invention is equally applicable, in fact more 
applicable, to voice over internet protocol (IP). The invention is also described within 
the context of a Tl or fractional Tl communication link. The preferred use of the 
invention is within the context of a Tl or fractional Tl since as the data rate of the 
communications channel decreases the usefulness of the invention also tends to 
decrease. The invention uses adaptive digital pulse code modulation (ADPCM), 
however, other encoding techniques may also be used. 

In one embodiment of the invention, the A/D converters 110 of FIG. 1 comprise 
5 bit A/D converters. In this embodiment of the invention, in addition to high, medium 
and low fidelity encoding, a very high level of encoding is also provided. That is, in the 
above-described embodiment of the invention, high, medium and low fidelity encoding 
comprise 32 Kbps, 32 Kbps and 16 Kbps encoding, respectively. In the 5 bit A/D 
converter embodiment, a very high fidelity encoding level of 40 Kbps is also used. It 
will be appreciated by those skilled in the art that the watermark levels B and buffer 
utilization levels Q(t) are adapted to accommodate four levels of encoding rather than 
three levels of encoding. Such adaptation may occur in the initial encoding 
embodiment of FIG. 3 or the retroactive encoding embodiment of FIG. 4. 

In one embodiment of the invention, a Tl line having a data rate of 1.536 Mbps 
was supplied via the system 100 of FIG. 1 using a first watermark level I?! set at 63 
cells and a second watermark level B 2 set at 68 cells (out of 72 cells). 

In operation, the system 100 of FIG. 1 so configured met a design specification 
wherein at least 95% of the voice coding would be done at 32 Kbps, while no more than 
1% of the voice coding would be done at 16 Kbps. Moreover, a specified loss rate of 
less than 0.0001% was achieved. In this embodiment of the invention, the voice 
multiplexer utilized a buffer having a capacity of 20 milliseconds (assuming a 32 Kbps 
coding and 0 bytes of overhead per cell). It will be appreciated by those skilled in the 
art, and informed by the teachings of the present invention, that lowering the watermark 
levels will lower loss but at the expense of requiring lower coding rates for more of the 
voice data. It is noted that the Retro coding scheme described above with respect to 
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FIG. 4 typically experiences less loss and less coding at 16 Kbps. Thus, the Retro 
coding scheme is typically preferable to the initial coding scheme. 

In experiments conducted by the inventor, it was determined that a 20 
millisecond delay within the transmit buffer provided a maximum amount of delay for a 
subjectively " good" quality of service (QOS). 

Improved QOS may be provided by reducing the transmit buffer delay, while 
reduced QOS may be provided by increasing the transmit buffer delay. At Tl data 
rates, a 20 millisecond delay represents approximately 72 cells worth of information. 
Thus, using the all buffers watermarking technique described above, a watermark level 
set at 54 equates to a 15 millisecond delay, while a watermark set at 65 cells equates to 
a 18 millisecond delay at Tl rates. 

Although various embodiments which incorporate the teachings of the present 
invention have been shown and described in detail herein, those skilled in the art can 
readily devise many other varied embodiments that still incorporate these teachings. 
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